Ppdp-mlt: K−anonymity Privacy Preservation for Publishing Search Engine Logs

نویسنده

  • Rajani Devi
چکیده

In this paper we investigate the problem of protecting privacy for publishing search engine logs. Search engines play a crucial role in the navigation through the vastness of the Web. Privacy-preserving data publishing (PPDP) provides methods and tools for publishing useful information while preserving data privacy. Recently, PPDP has received considerable attention in research communities, and many approaches have been proposed for different data publishing scenarios. In this paper we study privacy preservation for the publication of search engine query logs. Consider an issue that even after removing all personal characteristics of the searcher, which can serve as links to his identity, the publication of such data, is still subject to privacy attacks from adversaries who have partial knowledge about the set. Our experimental results show that the query log can be appropriately anonymized against the specific attack, while retaining a significant volume of useful data. In this paper we study about problem in search logs and why the log is not secure and how to make log secure using data mining algorithm and techniques like Generalization, Suppression and Quasi identifier.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Layered Approach for Personalized Search Engine Logs Privacy Preserving

In this paper we examine the problem of defending privacy for publishing search engine logs. Search engines play a vital role in the navigation through the enormity of the Web. Privacy-preserving data publishing (PPDP) provides techniques and tools for publishing helpful information while preserving data privacy. Recently, PPDP has received significant attention in research communities, and sev...

متن کامل

Publishing Search Logs - A Comparative Study of Privacy Guarantees

Search engine companies collect the “database of intentions”, the histories of their users’ search queries. These search logs are a gold mine for researchers. Search engine companies, however, are wary of publishing search logs in order not to disclose sensitive information. In this paper we analyze algorithms for publishing frequent keywords, queries and clicks of a search log. We first show h...

متن کامل

Privacy in Search Logs

Search engine companies collect the “database of intentions”, the histories of their users’ search queries. These search logs are a gold mine for researchers. Search engine companies, however, are wary of publishing search logs in order not to disclose sensitive information. In this paper we analyze algorithms to publish frequent keywords, queries and clicks of a search log. How do their formal...

متن کامل

Privacy preserving data publishing: Review

Privacy preserving data publishing (PPDP) methods a new class of privacy preserving data mining (PPDM) technology, has been developed by the research community working on security and knowledge discovery. It is common to share data between two organizations in many application areas. When data are to be shared between parties, there could be some sensitive patterns which should not be disclosed...

متن کامل

Semantic microaggregation for the anonymization of query logs using the open directory project

Web search engines gather information from the queries performed by the user in the form of query logs. These logs are extremely useful for research, marketing, or profiling, but at the same time they are a great threat to the user’s privacy. We provide a novel approach to anonymize query logs so they ensure user k-anonymity, by extending a common method used in statistical disclosure control: ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012